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FUNGAL CELLS WITH INACTIVATED DNA MISMATCH REPAIR SYSTEM 



5 FIELD OF INVENTION 

A process for making DNA libraries in filamentous fungal 
cells using a novel cloned gene involved in the mismatch repair 
system of filamentous fungal cells. 

10 BACKGROUND OF THE INVENTION 

The mismatch repair system is a system within cells which 
recognises mismatches in newly synthesised duplex DNA 
sequences . 

The mismatch repair system then either corrects the 
is mismatches which are seen as errors by e.g. using the 

methylated "old' 1 strain as template or alternatively it may 
mediate degradation of the duplex DNA sequences which comprise 
the mismatches . 

Independently on the precise mechanism the end result 
20 will be that the "mismatch repair system 1 1 will limit the 
""diversity* 1 within a cell, diversity being represented as 
duplex DNA sequences which comprise mismatches. 

For example a duplex DNA sequence which comprises a 
single mismatch represents a diversity of two different DNA 
25 sequences within the cell. If the mismatch repair system 
corrects the mismatch there will only be a diversity of one 
within the cell. 

Alternatively, if the mismatch repair system mediates the 
degradation of such a duplex DNA sequence the diversity will be 
30 lost. See figure 1 for a graphic illustration on how the ■ 
mismatch repair system may work within a cell. 

Consequently, if duplex DNA sequences comprising 
mismatches represent a DNA library of interest, then the 
diversity of this library may be limited when transformed 
35 (placed) into cells with an active mismatch repair system. 

The art provides a solution to this problem by making 
cells wherein the mismatch system is inactive. 
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EP 44 9923 describes bacterial cells wherein the mismatch 
system is inactivated. 

WO 97/37011 describes yeast cells wherein the mismatch 

system is inactivated. See the working examples of this 
document . 

WO 97/05268 describes mice cells wherein the mismatch 

system is inactivated. See the working examples of this 
document . 



10 SUMMARY OF THE INVENTION: 

The problem to be solved by the present invention is to 
provide an improved strategy for making DNA libraries in 
filamentous fungal cells. A filamentous fungal cell population 
comprising such a DNA library may then be used to select a 
is polypeptide of interest. Also polynucleotide sequences with 
particular properties might be selected, such as promoters, 
terminators and other regulatory elements with changed/ improved 
properties . 

The solution is based on that the present inventors have 
20 cloned a NOVEL gene involved in the mismatch repair system of a 
filamentous fungal cell. Further, this gene is the first gene 
cloned which is involved in the mismatch repair system of a 
filamentous fungal cell. 

By inactivating this gene in a filamentous cell it is 
25 possible to obtain a filamentous cell which is deficient in its 
mismatch repair system and which is highly useful for preparing 
DNA libraries in filamentous fungal cells. 

The gene comprises a very characterising DNA sequence 
encoding the polypeptide sequence shown in positions 683-758 of 
30 SEQ ID NO 2. 

This DNA has been used to clone the full length gene 
encoding the polypeptide sequence shown in positions 1-940 of 
SEQ ID NO 2 . See working examples herein (vide infra) . 

The gene cloned as described in working examples herein 
35 is a gene cloned from an Aspergillus oryzae filamentous fungal 
cell . 

However, based on the novel sequence information provided 
herein it is routine work for the skilled person to clone 
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similar homologous genes from other filamentous fungal cells by 
e.g. standard hybridisation or PCR technology, preferably by 
using the DNA sequence encoding the polypeptide sequence shown 
in positions 683-758 of SEQ ID NO 2 as a basis for making a 
5 hybridisation probe or PCR primers . 

Accordingly, in a first aspect the present invention 
relates to a filamentous fungal cell, wherein a gene involved 
in the mismatch repair system has been inactivated and in which 
the gene involved in the mismatch repair system comprises: 
10 (a) a DNA sequence encoding the polypeptide sequence 

shown in positions 683-758 of SEQ ID NO 2; or 
(b) a DNA sequence encoding a polypeptide sequence which* 
is at least 70% identical to the polypeptide sequence 
shown in positions 683-758 of SEQ ID NO 2; 

is and 

in a second aspect the present invention relates to a 
filamentous fungal cell, wherein a gene involved in the 
mismatch repair system has been inactivated and in which the 
gene involved in the mismatch repair system comprises: 
20 (a) a DNA sequence encoding the polypeptide sequence 

shown in positions 1-940 of SEQ ID NO 2; or 
(b) a DNA sequence encoding a polypeptide sequence which 
is at least 70% identical to the polypeptide sequence 
shown in positions 1-940 of SEQ ID NO 2. 

25 

As stated above a filamentous fungal cell of the first or 
second aspect of the invention is very suitable for making a 
DNA library of interest in filamentous fungal cells. 

Accordingly, in a third aspect the present invention 
30 relates to a process for preparing a filamentous fungal cell 
population wherein individual cells in the population comprise 
individually different DNA sequences of interest representing a 
DNA library of interest comprising following steps: 

(a) placing individually different DNA sequences of 
35 interest in a filamentous fungal cell population 

comprising a filamentous fungal cell of the first or 
second aspect of the invention; and 

(b) growing the population of (a) for a period of time 
allowing an individual DNA sequence of interest in the 
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population to be duplicated at least once under 
conditions wherein the mismatch repair system gene of 
the first or second aspect of the invention has been 
inactivated . 

5 

One of the advantages of allowing an individual mismatch 
repair inactivated filamentous fungal cell duplicated DNA of 
interest at least once as descried under step (b) of the third 
aspect is illustrated in figure 1. As can be seen in figure 1 

10 the process of the third aspect using a filamentous fungal mis- 
match repair inactivated cell as described herein allows prepa- 
ration of a DNA library wherein eventual hetero duplex DNA mis- 
matches are not corrected. This gives a DNA library with a 
higher diversity as compared to a DNA library made in a fila- 

15 mentous fungal cell NOT having an inactivated mismatch repair 
system (see figure 1) . Duplication of DNA sequence of interest 
means that the two strands are replicated such that two sepa- 
rate sets of double stranded DNA are generated, each being 
based on a separate one of the two original strands. 

20 A filamentous fungal cell population wherein individual 

cells in the population comprise a DNA library of interest as 
described above may be used to select a polypeptide of 
interest . 

Accordingly, in a fourth aspect the present invention 
25 relates to a process for production of a polypeptide of 
interest comprising the steps of the third aspect and wherein 
the DNA sequences of interest encode a polypeptide of interest 
and which further comprises following step: 

(c) selecting from the resultant population of 
30 filamentous fungal cells of step (b) of the third 

aspect a desired polypeptide of interest. 
An advantage of the process of the fourth aspect may be 
that the polypeptide of interest is selected from a filamentous 
fungal cell expressing the polypeptide. Consequently, it is 
35 directly known that the polypeptide can be expressed from a 
filamentous fungal cell, which is usefully if it is 
subsequently required to produce the polypeptide in large scale 
in a filamentous fungal cell. This may be of particular 
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interest when the DNA library encodes polypeptides of interest 
which are derived from filamentous fungal cells, since it is 
known that filamentous fungal polypeptides preferably are 
produced in industrial relevant high yields in filamentous 
5 fungal cells. 

This is contrary to a similar selection process using 
e.g. a yeast cell. Here the only thing known is that the 
selected polypeptide is capable of being expressed in yeast and 
later expression a filamentous fungal cell might give problems, 
io especially if high yields are* required. 

DEFINITIONS : 

Following section provides definitions of technical 
features in above-mentioned aspects of the invention. 

is The term "' a gene" denotes herein a gene (a DNA sequence) 

will is capable of being expressed into a polypeptide within 
said cell. Accordingly, said gene sequence will be defined as 
an open reading frame starting from a start codon (normally 
"ATG 1 1 , "GTG" , or "TTG 1 ' ) and ending at a stop codon 

20 (normally "TAA" , TAG 1 ' or TG A 1 1 ) . 

In order to express said gene there must be elements, as 
known in the art, in connection with the gene, necessary for 
expression of the gene within the cell. Such standard elements 
may include a promoter, a ribosomal binding site, a termination 

25 sequence, and may be others elements as known in the art. 

The term " mismatch repair system 1 1 shall herein be 
understood according to the art, as a system within cells which 
recognises mismatches in duplex DNA sequences. See e.g. WO 
97/37011, page 1, line 21-28) 

30 The mismatch repair system then either corrects the 

mismatches which are seen as errors by e.gr. using the 
methylated "old" strain as template or alternatively it may 
mediate degradation of the duplex DNA sequences which comprise 
the mismatches. 

35 Independently on the precise mechanism the end result 

will be that the "mismatch repair system" will limit the 
"diversity" within the cell represented by such duplex DNA 
sequences which comprise mismatches. 
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For example a duplex DNA sequence which comprises a 
single mismatch represents a diversity of two different DNA 
sequences within the cell. If the mismatch repair system 
corrects the mismatch their will only be a diversity of one 
5 within the cell. Alternatively, if the mismatch repair system 
mediates the degradation of such a duplex DNA sequence this 
diversity will be lost. 

A polypeptide encoded by a gene involved in the mismatch 
repair system recognises a mismatch by a mechanism involving 
io binding to the mismatch. 

Accordingly, a suitable assay to test whether or not a 
filamentous fungal cell as described herein is inactivated in 
its mismatch repair system is to use a ^ v gel shift assay 1 1 or 
alternatively termed a ""gel retardation assay 11 . This is a 
is standard assay used in the art. See WO 97/05268, page 16,17 and 
25. 

The principle in such an assay is that cell extracts are 
prepared of both (a) a filamentous fungal cell wherein the 
gene, as described herein, involved in the mismatch repair 

20 system is inactivated; and (b) the corresponding filamentous 
fungal cell wherein the gene is NOT inactivated. These extracts 
are then bound/mixed to oligonucleotides containing the base- 
pair mismatched G:T; G: A; G:G; A:C, and an extrahelical TG 
dinucleotide and run on a nondenaturing gel. 

^5 If the gel shift assay demonstrates that the control 

filamentous fungal cell wherein the gene is NOT inactivated 
comprises any protein (s) which binds to any of above mentioned 
oligonucleotides and these binding protein (s) are NOT seen in 
the filamentous fungal cell wherein the gene, as described 

30 herein, involved in the mismatch repair system is inactivated 
then it is a confirmation that the mismatch repair system in 
the latter is inactivated. 

A detailed description of a suitable gel shift assay is 
provided in working example 1 herein. 

35 The sequence identity in relation to the terms 

" a DNA sequence encoding a polypeptide sequence which is 
at least 70% identical to the polypeptide sequence shown in 

positions 683-758 of SEQ ID NO 2 1 ' and 
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"" a DNA sequence encoding a polypeptide sequence which is 
at least 70% identical to the polypeptide sequence shown in 

positions 1-940 of SEP ID NO 2 " ; 

is determined as the degree of identity between two 
5 sequences indicating a derivation of the first sequence from 
the second. The identity may suitably be determined by means of 
computer programs known in the art such as GAP provided in the 
GCG program package (Program Manual for the Wisconsin Package, 
Version 8, August 1994, Genetics Computer Group, 575 Science 

10 Drive, Madison, Wisconsin, USA 53711) (Needleman, S.B. and 
Wunsch, CD., (1970), Journal of Molecular Biology, 48, 443- 
453) . Using GAP with the following settings for polypeptide 
sequence comparison: GAP creation penalty of 3.0 and GAP 
extension penalty of 0.1, the polypeptide encoded by an 

is analogous DNA sequence of the invention exhibits a degree of 
identity preferably of at least 70%, more preferably at least 
80%, more preferably at least 90%, more preferably at least 
95% , and especially at least 97% with amino acid sequence shown 
in positions 683-758 of SEQ ID NO 2, according to the first 

20 aspect of the invention; or with amino acid sequence shown in 
positions 1-940 of SEQ ID NO 2 , according to the second aspect 
of the invention. 

The term "" DNA library ' ' denotes herein a library of at 
least two different DNA sequences. For many practical purposes 

25 the library is must bigger. Accordingly, the DNA library 
preferably comprises at least 1000 different DNA sequences, 
more preferably at least 10000 different DNA sequences, and 
even more preferably at least 100000 different DNA sequences. 

The term "" placing individually different DNA sequences of 

30 interest in a filamentous fungal cell population " in relation 
to step (a)' in the process of the third aspect of the invention - 
shall herein be understood broadly in the sense that it is NOT 
identical DNA sequences of interest which are placed in the 
filamentous fungal cell population. In the present context, 

35 relating to a process for making a DNA library using a mismatch 
repair deficient cell, the term should preferably denotes a 
situation wherein a cell within the filamentous fungal cell 
population comprises at least two different DNA sequences of 
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interest which are so partially homologous that they are 
capable of hybridising/recombining to each other within the 
cell. It is within the skilled persons general knowledge to 
determine how partially homologous such sequences have to be in 
5 order to obtain said recombination within the cell. 

A practical example may be that single stranded 
oligonucleotide sequences partially homologous to chromosomal 
DNA sequence are placed within the cell or duplex DNA sequences 
comprising mismatches (e.g. comprised within a vector) are 
10 placed within the cell. See below for further description of 
such examples . 

The specific experimental way of placing these DNA 
sequences within a filamentous cell may be done according to 
any of the many suitable techniques, such as transformation 
is techniques. 

The term "" growing the population of (a) for a period of 
time allowing an individual DNA seguence of interest in the 
population to be duplicated at least once under conditions 
wherein the mismatch repair system gene has been inactivated 11 

20 according to step (b) of the third aspect of the invention 
denotes that after an individual cell has duplicated itself at 
least once then the mismatch repair system may be activated 
again without loosing the advantage of the process. The 
technical reason for this is illustrated in figure 1. In this 

25 example a duplex DNA sequence comprising a single mismatch is 
placed in filamentous cell. After the cell has been duplicated 
once under conditions wherein the mismatch repair system gene 
has been inactivated the two individually different single 
stranded DNA sequences within the duplex DNA have individually 

30 been duplicated providing two different duplex sequences, one 
in each duplicated cell, without any mismatches. Accordingly, 
since such a cell does NOT comprise duplex DNA sequences of 
interest having mismatches then there is no technical need to 
maintaining the mismatch repair system inactivated. 

35 In sections below are described preferred embodiments of 

the invention by way of examples only. 



DRAWINGS : 
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Figure 1 : 

This figure illustrates an example wherein a duplex DNA 
sequence comprising a single mismatch is placed in filamentous 
cell. After the cell has been duplicated once under conditions 
5 wherein the mismatch repair system gene has been inactivated 
the two individually different single stranded DNA sequences 
within the duplex DNA have individually been duplicated 
providing two different duplex sequences, one in each 
duplicated cell, without any mismatches. On the contrary, in a 
10 cell wherein the mismatch repair system is active, a mismatch 
within a duplex is corrected. 

Figure 2 : 

This figure shows three partial Aspergillus oryzae 

is polypeptide sequences: "msh2 1 Ao-collO/13/15 ; derived from 
cloned PCR fragments. The three partial polypeptide sequences 
are aligned with two other partial polypeptide sequences of 
known mismatch repair proteins: a human mismatch repair 
protein, msh2-human.p; and a fungal Saccharomyces cerevisiae 

20 mismatch repair protein, S.c. msh2 . The underlined sequences in 
the figure derive from the construction of the PCR fragments. 

Figure 3 : 

This figure shows an alignment of the proposed 
25 polypeptide sequence of the putative Aspergillus oryzae 

mismatch repair protein (Ao.MSH2) with the polypeptide 
sequences of three known mismatch repair proteins from human 
( msh2- human. p) , mouse (msh2-mus.p) , and yeast (S.c. msh2) . 

30 

DETAILED DESCRIPTION OF THE INVENTION 

A filamentous fungal cell, as described herein, wherein a gene, 
as described herein, involved in the mismatch repair system 
35 has been inactivated. 



Inactivation of a gene involved in the mismatch repair system: 
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The NOVEL gene, as described herein, involved in the 
mismatch repair system may be inactivated by any of the 
numerous known techniques known to the skilled person. 

An embodiment of the invention relates to a filamentous 
5 fungal cell as described herein, wherein the gene involved in 
the mismatch repair is defective. 

Numerous methods are known to the skilled person to make 
a gene defective when the DNA sequence is KNOWN. These methods 
includes deleting part of the DNA sequence of the gene; 
10 introducing frame -shift mutations by deleting or inserting 
nucleotides; introducing stop codons etc. 

A preferred embodiment of the invention relates to a 
filamentous fungal cell as described herein, wherein the gene 
involved in the mismatch repair has been inactivated 
is transitorily. 

Similarly to above, a number of methods are known to the 
skilled person for doing this, including insertion of a 
regulable promoter upstream of the gene or e.g. permanently 
deleting part of the gene on the chromosome followed by 
20 inserting a vector (e.g. a plasmid) into the cell which 
comprises the gene. The plasmid may then comprise a regulable 
promoter up -steam of the gene or it may be that the plasmid can 
be removed from the cell when the mismatch repair system shall 
be inactivated transitorily and then re-inserted into the cell 
25 when the mismatch repair system shall be re-activated. 

It is within the skilled persons general knowledge to 
choose the appropriate strategy for a specific technical 
purpose . 

A preferred way to make a filamentous fungal cell which 
30 is capable of transitorily inactivate the mismatch repair 
system as described herein is first to permanently inactive the 
mismatch repair gene described herein on the chromosome of the 
cell followed by inserting a plasmid into the cell which 
comprises the gene, wherein the plasmid is characterised by 
35 that it comprises a suitable replication initiating sequence 
and a suitable selectable marker. 

Preferably the suitable replication initiating sequence 
is AMA1 (Gems, D., et al . (1991, Gene 98:61-67). 
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A more detailed description of suitable replication 
initiating sequences and suitable selectable markers is 
provided immediately below and in working example 4 herein is 
provided an example of this strategy using a plasmid comprising 
5 AMA1 as replication initiating sequence and AmdS as selectable 
marker . 

Replication initiating sequences 

As used herein, the term "fungal replication initiating 

10 sequence" is defined as a nucleic acid sequence which is capa- 
ble of supporting autonomous replication of an extrachromosomal 
molecule, e.g., a plasmid or a DNA vector, in a fungal host 
cell, normally without structural rearrangement of the plasmid 
or integration into the host cell genome. The replication ini- 

15 tiating sequence may be of any origin as long as it is capable 
of mediating replication initiating activity in a fungal cell. 
Preferably, the replication initiating sequence is obtained 
from a filamentous fungal cell, more preferably a strain of As- 
pergillus, Fusarium or Alternaria, and even more preferably, a 

20 strain of A. nidulans, A. oryzae, A. niger, F. oxysporum or Al- 
ternaria al tenata . 

A replication initiating sequence may be identified by 
methods well-known in the art. For instance, the sequence may 
be identified among genomic fragments derived from the organism 

25 in question as a sequence capable of sustaining autonomous rep- 
lication in yeast, (Ballance and Turner, Gene, 36 (1985), 321- 
331), an indication of a capability of autonomous replication 
in filamentous fungal cells. The replication initiating activ- 
ity in fungi of a given sequence may also be determined by 

30 transforming fungi with contemplated plasmid replicators and 
selecting for colonies having an irregular morphology, indicat- 
ing loss of a sectorial plasmid which in turn would lead to 
lack of growth on selective medium when selecting for a gene 
found on the plasmid (Gems et al, Gene, 98 (1991) 61-67) . AMA1 

35 was isolated in this way. An alternative way to isolate a rep- 
lication initiating sequence is to isolate natural occurring 
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plasmids (e.g. as disclosed by Tsuge et al., Genetics 146 
(1997) 111-120 for Alternaria aternata) . 

Examples of replication initiating sequences include, but 
are not limited to, the ANSI and AMA1 sequences of Aspergillus 
5 nidulans, e.g., as described, respectively, by Cullen, D., et 
al. (1987, Nucleic Acids Res. 15:9163-9175) and Gems, D., et 
al. (1991, Gene 98:61-67). 

The term "replication initiating activity" is used herein 
in its conventional meaning, .i.e. to indicate that the sequence 
10 is capable of supporting autonomous replication of an ex- 
trachromosomal molecule, such as a plasmid or a DNA vector in a 
fungal cell . 

The term "without structural rearrangement of the plasmid" 
is used herein to mean that no part of the plasmid is deleted 
is or inserted into another part of the plasmid, nor is any host 
genomic DNA inserted into the plasmid. 

Filamentous fungal selective marker 

The term "selective pressure" is defined herein as culturing 

20 a filamentous fungal cell, containing a DNA vector containing a 
fungal selective marker gene operably linked to a polynucleo- 
tide sequence of interest, in the presence of an effective 
amount or the absence of an appropriate selective agent. The 
effective amount of the selective agent is defined herein as an 

25 amount sufficient for allowing the selection of cells contain- 
ing the selection marker from cells which do not contain the 
selection marker. 

In a preferred embodiment, the fungal selective marker is 
selected from the group of genes which encodes a product capa- 

30 ble of providing resistance to biocide or viral toxicity, re- 
sistance to heavy metal toxicity, or prototrophy to auxotrophs. 

In a more preferred embodiment, the prototrophy is obtained 
from an enzyme selected from the group of metabolic pathways 
consisting of nucleotide synthesis, cofactor synthesis, amino 

35 acid synthesis, acetamide metabolism, proline metabolism, sul- 
fate metabolism, and nitrate metabolism. 
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In an even more preferred embodiment, in the methods of the 
present invention the fungal selective marker is a gene se- 
lected from the group consisting of argB (ornithine carbamoyl- 
transferase) , amdS (acetamidase) , bar (phosphinothricin acetyl- 
5 transferase) , hemA ( 5-aminolevulinate synthase) , hemB (porpho- 
bilinogen synthase) , hygB (hygromycin phosphotransferase) , niaD 
(nitrate reductase) , prn (proline permease) , pyrG (orotidine- 
5' -phosphate decarboxylase), pyroA, riboB, sC (sulfate adenyl- 
transferase) , and trpC (anthranilate synthase) . 

10 The fungal cell is cultivated in a suitable medium and under 
suitable conditions for screening or selecting for transfor- 
mants harbouring the variant polynucleotide sequence of inter- 
est having or encoding the desired characteristic. The cultiva- 
tion may be performed in accordance with methods well-known in 

is the art for screening of polynucleotide variant libraries. 

The filamentous fungal cell 

The filamentous fungal cell as described herein includes 
all filamentous forms of the subdivision Eumycota and Oomycota. 

20 The filamentous fungi are characterised by a mycelial wall 
composed of chitin, cellulose, glucan, chitosan, mannan, and 
other complex polysaccharides. Vegetative growth is by hyphal 
elongation and carbon catabolism is obligately aerobic. In 
contrast, vegetative growth by yeasts such as Saccharomyces 

25 cerevisiae is by budding of a unicellular thallus and carbon 
catabolism may be fermentative. In a preferred embodiment, the 
filamentous fungal cell is a cell of a species of, but is not 
limited to, Acremonium, Aspergillus, Fusarium, Humicola, Mucor, 
Myceliophthora, Neurospora, Penicillium, Scytalidium, 

30 Thielavia, Tolypocladium, and Trichoderma . 

Examples of filamentous fungal cells of use in the present 
invention include an Aspergillus cell, an Acremonium cell, a 
Fusarium cell, a Humicola cell, a Mucor cell, a Myceliophthora 
cell, a Neurospora cell, a Penicillium cell, a Thielavia cell, 

35 a Tolypocladium cell, and a Trichoderma cell. 
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More specifically, the filamentous fungal cell is an Asper- 
gillus awamori, Aspergillus foetidus, Aspergillus japonicus, 
Aspergillus nidulans, Aspergillus niger, or Aspergillus oryzae 
cell; 

5 a Fusarium bactridioides, Fusarium cerealis, Fusarium 
crookwellense, Fusarium culmorum, Fusarium graminearum, 
Fusarium graminum, Fusarium heterosporum, Fusarium negundi, 
Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, 
Fusarium sambucinum, Fusarium sarcochroum, Fusarium 

10 sporotricioides , Fusarium sulphureum, Fusarium torulosum, 
Fusarium trichothecioides, Fusarium venenatum cell or a 
Fusarium venenatum cell (Nirenberg sp. nov; a Humicola 
insolens cell or a Humicola lanuginosa cell; a Mucor miehei 
cell; a Myceliophthora thermophila cell; a Neurospora crassa 

15 cell; a Penicillium purpurogenum cell; a Thielavia terrestris 
cell; or a Trichoderma harzianum, Trichoderma koningii, 
Trichoderma longibrachiatum r Trichoderma reesei, or Trichoderma 
viride cell. 

20 A process for preparing a filamentous fungal cell population 
comprising a DNA library according to the third aspect of the 
invention; 

Placing individually different DNA sequences of interest in a 
25 filamentous fungal cell population according to step (a) of the 

process of the third aspect of the invention: 

As state above, the specific experimental way of placing 

these DNA sequences within a filamentous cell may be done 

according to any of the many suitable techniques, such as 
30 transformation techniques. See the general fungal textbook 

""Fungal Genetic 1 » (1996, ISBN 0-8247-9544-X) for a further 

description of such standard techniques. 

A practical example may be that single stranded 

oligonucleotide sequences partially homologous to chromosomal 
35 DNA sequence are placed within the cell. See Calissano et al. 
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(Fungal genetic newsletter 43:15-16 (1995) for further 
description of this. 

Another example may be that duplex DNA sequences 
comprising mismatches (e.g. comprised within a vector as shown 
in figure 1) are placed within the cell. 

In an preferred embodiment the different DNA sequences of 
interest is comprised in a plasmid wherein the plasmid is 
characterised by that it comprises a suitable replication 
initiating sequence and a suitable selectable marker as 
described above . 

Preferably the suitable replication initiating sequence 
is AMA1 (Gems, D., et al. (1991, Gene 98:61-67). 

Growing the population of step (a) for a period of time 
allowing an individual DNA sequence of interest in the 
population to be duplicated at least once under conditions 
wherein the mismatch repair system gene, as describe herein has 
been inactivated, according to step (b) of the third aspect of 
the invention. 

Growing of the population may be done in any of the 
numerous suitable known media for growing filamentous fungal 
cells. It is within the skilled persons general knowledge to 
choose such a suitable media. 

As explained above an individual cell in the population 
must be allowed to be duplicated at least once under conditions 
wherein the mismatch repair system gene, as described herein 
has been inactivated. 

Said cells may of course be allowed to be duplicated more 
than once under conditions wherein the mismatch repair system 
gene has been inactivated. 

Since inactivation of the mismatch repair system normally 
will cause accumulation of mutations on the chromosomal DNA 
within the cell and thereby maybe make lethal mutations to the 
cell the actual preferred number of duplication cycles as 
described above will depend on how fast such potential lethal 
mutations arise. 

It is within the skilled persons general knowledge to 
determine how many of duplication cycles it preferred. 
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Due to these potential lethal mutations it is preferred 
that the mismatch repair system under step (b) has been 
inactivated transitorily. 

After suitable cycles of duplication according to step 
5 (b) of the third aspect the transitorily inactivated mismatch 
repair system the mismatch repair system is then re-activated 
in order to avoid these lethal mutations in the filamentous 
fungal cell as such. The strategy for this transitorily 
inactivation may be any of the strategies described above. 
10 Another strategy to limit introduction of mutations on 

the chromosome, is to transitorily stop the chromosomal 
replication while replicating the extra-chromosomal element 
under mismatch repair deficient conditions. This can be 
achieved by introducing mutations in elements being solely 
15 necessary for the chromosomal replication. 

A preferred strategy is to use a filamentous fungal cell 
wherein the gene involved in the mismatch repair system as 
described herein is permanently inactivated on the chromosome 
of the cell followed by inserting a plasmid into the cell which 
20 comprises the gene, wherein the plasmid is characterised by 
that it comprises a suitable replication initiating sequence 
and a suitable selectable marker. See above for a further 
explanation of this strategy. 

Preferably the suitable replication initiating sequence 
25 is AMA1 (Gems, D., et al . (1991, Gene 98:61-67). 

A further embodiment relates to the process of the third 
aspect of the invention, wherein the mismatch repair system 
under step (b) is defective. 

In a further embodiment the invention relates to a 
30 process as described herein, wherein, under step (b) of the 
third aspect of the invention, there is an in vivo intergenic 

recombination of partially homologous DNA sequences of 
interest . 

Since the overall concept of the present invention is to 
35 provide a process involving inactivation of the mismatch system 
it is of course preferably that said partially homologous DNA 
sequences are capable of in vivo forming duplex DNA sequences 
comprising mismatches. 
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A process for production of a polypeptide of interest 
comprising the steps of the third aspect of the inv ntion and 
wherein the DNA sequences of interest encode a polypeptide of 
5 interest, according to the fourth aspect of the invention; 

Selecting from the resultant population of filamentous funeral 
cells of step (b) of the third aspect a desired polypeptide of 
interest, according to step (c) of the fourth aspect. 

10 The desired polypeptide of interest may be any 

polypeptide comprising an desired technical feature, such as 
improved stability; a desired specific activity; a desired pH 
optimum; an improved wash performance in a detergent; etc. 

The specific strategy for selecting this desired 

15 polypeptide of interest may be any of the numerous selecting 
strategies known to the skilled person, such as plate screening 
assays, micro- titer plate based assays, etc. 

An embodiment of the invention relates to a process of 
20 the fourth aspect of the invention, which further comprises 
following steps: 

(d) an optionally step comprising modifying the amino 
acid sequence of the desired selected polypeptide of 
interest according to a particularly further specific 

2 5 need; 

(e) placing the DNA sequence encoding the polypeptide of 
interest of step (c) of the fourth aspect or the 
modified polypeptide of interest of step (d) into a 
filamentous fungal cell which is suitable for large 

30 scale production of the polypeptide of interest ; 

(f) cultivating the filamentous fungal cell of step (e) 
in a fermentor of at least 10000 m3 under conditions 
permitting expression of the polypeptide of interest; 
and 

35 (g) isolating the polypeptide of interest. 

This embodiment relates to an industrial very relevant 
process, wherein the selected polypeptide of interest is 
produced in large scale . 
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The optionally step (d) relates to a situation wherein 
e.g. the desired polypeptide of interest is selected in order 
to e.g. identify a polypeptide with improved wash performance 
in a detergent according to step (c) of the third aspect of the 
5 invention. This polypeptide while having improved wash 
performance in a detergent may not be sufficiently stable for a 
commercial product. Accordingly, it may be required to make 
some further amino acid substitutions in this selected 
polypeptide, such as e.g. suitable Proline substitutions on 
10 order to obtain sufficient "stability to commercialising this 
polypeptide . 

A further embodiment relates to a process of the 
embodiment immediately above, wherein the filamentous fungal 
cell which is suitable for large-scale production of the 
is polypeptide of interest of step (e) said embodiment is another 

filamentous fungal cell as compared to the filamentous fungal 
cell of step (a) of the third aspect of the invention. 

This embodiment relates to a situation wherein the 
filamentous fungal cell used to select the polypeptide of 
20 interest is different from the one which is used for large 

scale production. 

A further embodiment relates to a process as described 

herein, wherein the polypeptide of interest is a polypeptide 

derived from a filamentous fungal cell. 
25 The term derived from a filamentous fungal cell should be 

understood in the sense, that the information in the amino acid 

sequence of the polypeptide of interest is derived from a 

polypeptide obtained from a filamentous fungal cell. 

Consequently, it may be a variant of a wild-type 
30 filamentous fungal polypeptide and/or may be a polypeptide 

which is a result of a recombination/shuffling of two or more 

different filamentous fungal polypeptides. 

In an even further embodiment the invention relates to a 

process as described herein, wherein the polypeptide of inter- 
35 est is an enzyme, such as an amylase, a protease, a cellulase, 

a lipase, a xylanase; a phospholipase . 
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Materials 

Chemicals used as buffers and substrates were commercial 
products of at least reagent grade. 

EXAMPLE 1: 

A gel shift assay suitable for determining if a filamentous 
fungal cell as described herein is inactivated in the mismatch 
repair system: 

The principle in this gel shift assay is that cell 
extracts are prepared of both (a) a filamentous fungal cell 
wherein the gene, as described herein, involved in the mismatch 
repair system is inactivated; and (b) the corresponding 
filamentous fungal cell wherein the gene is NOT inactivated. 
These extracts are then bound/mixed to oligonucleotides 
containing the base-pair mismatched G;T; G: A; G:G; A:C, and an 
extrahelical TG dinucleotide and run on a nondenaturing gel. 

If the gel shift assay demonstrates that the control 
filamentous fungal cell wherein the gene is NOT inactivated 
comprises any protein (s) which binds to any of above mentioned 
oligonucleotides and these binding protein (s) is NOT seen in 
the filamentous fungal cell wherein the gene, as described 
herein, involved in the mismatch repair system is inactivated 
then it is a confirmation that the mismatch repair system in 
the latter is inactivated. 
Experimental procedure : 

Preparation of cell extracts are performed as described 
in Nagata et al. (Mol. Gen Genet (1993) 237:251-260; See 
Materials and Methods) . 

Annealing of oligonucleotides, binding of cell extracts 
to duplex oligonucleotides containing mismatched, and 
nondenaturing polyacrylamide gelelectrophoresis are performed 
essentially as described (Stephenson and Karran; Selective 
binding to DNA base pair mismatches by proteins from human 
cells; J. Biol. Chem. 264:2177-21182 (1989)). 

However; gelelectrophoresis is performed in TAE buffer 
rather than in TBE buffer. To obtain duplex oligonucleotides, 
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the oligonucleotide U is radiolabelled and annealed with any of 
the unlabelled oligonucleotides L-G.T, L-G.A, L-G.C, L-A.C, L- 
T.G., or L-HOM. Oligonucleotide sequences are derived from 
Aquilina et al . Proc. Natl. Acad. Sci. USA 91:8905-8909 (1994). 

5 

Ul 5 ' -GGGAAGCTGCCAGGCCCCAGTGTCAGCCTCCTATGCTC-3 ' (SEQ ID NO 

3) ; 

L-G.T: 5 ' -GAGCATAGGAGGCTGACATTGGGGCCTGGCAGCTTCCC-3 ' (SEQ ID NO 

4) (resulting in a G.T mismatch) ; 

10 L-G.A. ; 5 " -GAGCATAGGAGGCTGACAATGGGGCCTGGCAGCTTCCC-3 " (SEQ ID NO 

5) (resulting in a G.A mismatch) ; 

L-G.G. ; 5 ' -GAGCATAGGAGGCTGACAGTGGGGCCTGGCAGCTTCCC-3 ' (SEQ ID NO 

6) (resulting in a G.G mismatch) ; 

L-A.C. ; 5 ' -GAGCATAGGAGGCTGACACCGGGGCCTGGdAGCTTCCC-3 " (SEQ ID NO 
15 7) (resulting in a A.C mismatch); 

L-TG: 5 ' -GAGCATAGGAGGCTGACACTGTGGGGCCTGGCAGCTTCCC- 3 ' (SEQ ID NO 

8) (resulting in an extrahelical TG dinucleotide) ; 
L-HOM: 5 ' -GAGCATAGGAGGCTGACACCGGGGCCTGGCAGCTTCCC-3 ' (SEQ ID NO 

9) (resulting in a homoduplex) . 

20 

In all assays, a twofold excess of unlabelled homoduplex 
competitor oligonucleotide is included. 

EXAMPLE 2: 

25 Cloning of a gene involved in the mismatch repair system of an 
Aspergillus oryzae cell. 

The gene cloned as described in this example is shown in 
SEQ ID NO 1 (DNA sequence) and SEQ ID NO 2 (the translated 
amino acid sequence) . 
30 Several sequences of mismatch repair proteins from 

various organisms are known, only three of these have been 
utilized in the following: 3. cerevisiae (M84170) , H. sapiens 
(L47580) and mouse (U21011) . 

The numbers indicated are reference numbers from the 
35 public available GenBank database. 
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Based on the C- terminal homology between known mismatch 
repair proteins, a set of degenerate primers were designed, 
aiming at amplification of a partial sequence of the 
Aspergillus oryzae homolog: 

5 

Pr 117858 (SEQ ID NO 10) : P-GGCNCARATHGGNTGYTTYGTNCC 
Pr 117859 (SEQ ID NO 11) : P-GCCCANGCNARNCCRAANCC 

With chromosomal DNA from A, oryzae strain JaL142 (WO 
10 96/29391) as template, and above primers the following 50/zl PWO 
polymerase based PCR reaction was performed at eight MgS0 4 
concentrations (0.5 mM to 4.0 mM, as recommended by the 
manufacturer; Boehringer M.) . 1 mM MgS0 4 was found to be optimal 
and gave a discrete band of appr. 230 bp as would be expected 
15 if no introns were embodied in the sequence. 

PCR-cycle profile: [96°C; 2 min - 30 cycles of (94°C; 15s - 
50°C; 15s - 72°C; 30s) - 72°C,7 min - 4°C; hold], 

20 The 23 0bp PCR fragment was blunt end ligated into filled 

in BamHl site of pUC19. pUC19 was BamHl cleaved in presence of 
calf intestine alkaline phosphatase, followed by filling in 
the sticky ends by klenow polymerase and dNTP. Three 
individual plasmids harbouring the insert were isolated from 

25 E.coli XL1 transf ormants of above ligation, and sequenced. 
Alignment of polypeptides derived by translation of the cloned 
PCR fragments, revealed a strong homolgy to known mismatch 
repair protein sequences (see Figure 2) . 

The underlined sequences of Figure 2 are sequences 

30 derived from the consensus PCR primers described above. 

The three Aspergillus sequences of Figure 2 are equal to 

the sequence shown in SEQ ID NO 2 from positions 683-758, 
except from position 685 which in the final cloned sequence is 
a Thr '(T) in sted of an lie (I) as indicated above. This is due 
35 to the sequence in above mentioned consensus primers. 

The alignment shown in Figure 2 clearly demonstrates that 
the cloned fragment originates from an A . oryzae homologue of a 
mismatch repair protein. 
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In order to clone the entire gene, a radiolabeled probe 
of the cloned fragment was generated by PCR, using 0.5 mg 
pUC19 ' msh2 ' -13 (see above) as template in a 100 ml reaction 
with Taq polymerase, 30 pmol pUC forward and reverse primers 
5 and o.2 mM of dG- , dC-, dTTP and 0.2mM dATP + 32 P-dATP. The 
generated radiolabeled probe was liberated from pUC19 sequences 
by EcoRl- Hind3 digestion followed by gel purification of the 
resulting 293 bp fragment. 

The probe was hybridized to a membrane gridded cosmid 
10 library of genomic DNA from A. oryzae strain A1560 (the father 

of JaL142) (W096/29391) . A positive clone was identified on the 
filter when analyzed in a phosphoimager, and the clone was 
identified as ^31A2. 

The X31A2 cosmid DNA was propagated and used for southern 
is analysis, using the same radiolabeled primer as above. An 
approximately 9 Kb Pst fragment, split by BstX (previously 
found in the cloned PCR fragment) into 5.8 and 3.2 kb fragments 
both lightening up with the probe, was identified and cloned 
into Pst cut pUC19, giving a plasmid named pUC19msh2P. The 
20 insert was sequentially sequenced, starting with primers 
pointing out from the previously determined sequence, followed 
by primers based on the sequences determined in the last run: 
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GCTCGAAACATCCAACATCC 

GCTGTGAATCACTTGCACC 

CTTCATAAACTGCGACAAATCATGC 

GGAGGAGCATCTTCGC 

GGAACTTGAAGACTTTACTTCATCC 

CCAGAAACTCGCTAACACC 

GTGCTTTGCGGACGC 

CAGGACAGTAGGACGC 

CGAGCGATGAACTCTGC 

GCGTTGGTGGATTATCC 

CGTTGCATCTATCATATACC 

GGTATATGATAGATGCAACGC 



The 3 825 bp sequence hereby determined (SEQ ID NO 1) was 
translated in the frame previously determined in the PCR 
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fragment. The resulting protein (SEQ ID NO 2) called Ao . MSH2 
was aligned to the protein sequences of known mismatch repair 
proteins in Figure 3 . From the alignment in shown in Figure 3 
the cloned and sequenced DNA clearly encompasses the coding 
5 sequence for a homolog of yeast, man and mouse mismatch repair 
proteins, with one intron in the N- terminal part. The position 
of the intron was deduced by the standard splice rules, and 
constitutes the only possibility. 

10 EXAMPLE 3: 

Disruption of the gene cloned in example 1 on the chromosome of 

an Aspergillus oryzae cell: 

For the disruption experiment the msh2 CDS was deleted 
from pUC19msh2P (see example 2) by PCR, introducing a Notl site 
is instead. This was done by the primers: 

137208 (SEQ ID NO 24): 5' P-CCGCGTCTCCAACAAGATGAATGG 
137207 (SEQ ID NO 25): 5' P - CCGCTTTCTCGGGGTCATAGC 

20 In a Pwo polymerase based PCR reaction with 2,5 mM MgS0 4 

and 50 pg pUC19msh2P (conditions according to the 
manufacturer) : 

PCR-cycle profile: [96°C; 2min - 4 cycles of (94°C; 30s - 
25 52°C;30s - 72°C; 3min) - 25 cycles of (94°C; 30s - 59°C; 30s - 
72°C; 3min) - 72°C; lOmin] 

The resulting PCR product of appr. 8.9 Kb was isolated, 
ligated into pUC19, and transformed into E. coli XL1. From the 

30 resulting transf ormants pMsh2A was isolated, and the 
correctness of the new junction and its surroundings verified 
by sequencing [primer 138149 (SEQ ID NO 26) : 
CCTTTCCACTTTAATCCTAAGC] . (Xll/pMsh2A: Lac3073). 

In this construct it the A. oryzae pyrG (WO 96/29391) is 

35 inserted into the Notl site. 

By using this construct the chromosomal gene is deleted 
in an Aspergillus oryzae cell according to standard techniques 
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known in the art for crossing in such a deleted gene on the 

chromosome by homologous recombination (Miller, B.L., et al . , 
1985 Mol. and Cell. Biol. 5:1714-1721). 

5 EXAMPLE 4: 

Construction of a plasmid comprising the mismatch repair gene 
shown in SEQ ID NO 1, the AMA1 replication initiating sequence; 
and the AmdS selectable marker; 

The plasmid constructed as described below is highly 

10 suitable for making a filamentous fungal cell wherein the 
mismatch repair system may be transitorily inactivated, wherein 
this plasmid may be inserted into a mismatch disrupted strain 
of example 3 when the mismatch repair system shall be activated 
and deleted from the strain when the mismatch repair system 

is shall be inactivated. 

Disruption of the mismatch repair gene may cause the 
accumulation of new chromosomal mutations, thus such a strain 
might be genetically unstable. Consequently it was decided to 
perform the chromosomal disruption in a strain where mismatch 

20 repair gene was expressed from an extra chromosomal element 
readily lost when the Amismatch repair phenotype was wanted. 

The extra-chromosomal element was here a plasmid 
comprising AMA1 as replication initiating sequence and AmdS as 
selectable marker. 

25 For this purpose the mismatch repair gene (SEQ ID NO 1 ) 

was cloned into an autonomously replicating construct 
harbouring one AMA1 repeat . 

From pMT1505 (See example 5 below for description of 
PMT1505) the following fragments were isolated and ligated 

30 together: 

5.16 kb Not I- [Hind3] * + 3.515 kb [Sal]* -BamHl + 757bp 
BamHl- Not I 

t ] * indicates that the site has been filled in by 
3 5 Klenow-polymerase and dNTP 

From this ligation reaction pMT1505DHS was isolated (LaC 
3212) , and the mismatch repair expression cassette was 
introduced as a BamHl - Muni fragment in the corresponding 
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sites in pMT1505DHS, resulting in the plasmid pAma-msh2 (LaC 
3216) . 

Aspergillus oryzae JaL250 (see example 5) was transformed 

AmdS* with pAma-msh2 / and the transf ormants cheked for the 
5 ability to lose the amdS character when unselected (50% of the 
transf ormants) , indicating the maintanance of this plasmid as 
extra chromosomal. (LaC3244 keep on acetamide + uridine). 



EXAMPLE 5: 

10 Construction of plasmid pMT1505 used in example 4: 
Flasmids 

pMT1505: constructed as described below in Example 5 

pHelpl: contains the pyrG gene from A. oryzae as a selective 

marker and the AMA1 sequences which enable autonomous 
is replication in A. nidulans as described by Gems, D., 

et al. (1991. Gene 98: 61-67) 
pToC68: as described in EP 0 531 372 (Novo Nordisk A/S) 



Strains 

20 JaL250: a derivative of Aspergillus oryzae A1560 in which the 

pyrG gene has been inactivated, as described in WO 
98/01470 

DH5a: an E. coli host cell purchased from GIBCO BRL (Life 

Technologies, Inc., Rockville MD) 

25 

pMT14 66 was constructed by inserting an Sphl/Narl fragment 
from pHelpl into pToC68. pMT1489 was constructed by digesting 
pMT1466 with SphI and StuI, then religating. pMT1500 was con- 
structed by digesting pMT1489 with Aatll and Narl and ligating - 

30 a linker. pMT1504 was constructed by digesting pMT1500 with 
Nhel aqd religating. 

PMT15Q5 was constructed by inserting a 2.7 kb Xbal fragment 
containing the amdS encoding gene from A. nidulans genomic DNA 
(Corrick, CM., et al . 1987, Gene 53:63-71) into pMT1504 which 

35 had been cut with Nhel. 
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EXAMPLE 6: 

Deletion of part of the mshll gene on the chromosome 

The plasmid p418MsHII (from lac3159) is cut with Sail and 
Xhol and treated with calf-intestinal phosphatase. In this man- 
ner part of the msHII gene is cut out. The large band (6400 bp) 
5 containing the vector and most of the msHII gene is isolated 
from a 1% agarose gel. 

The plasmid pJal554 was constructed by ligating a 
Spel/SspBI cut fragment (5330 bp) from pS02 with a Asp718/Nhel 
cut fragment (316 bp) from pS02 . Plasmid pJal554 is cut with 
Sail and a 2350 bp band which contains the pyrG gene is 
isolated on a 1% agarose gel. The 2350bp band with pyrG is 
ligated with the cut p418MsHII plasmid and transformed into £. 
coli. The right E. coli transformant is identified by 
restriction analysis and a plasmid preparation is made from 
this transformant . 

The plasmid thus prepared is cut with £coRI in order to 
linearize the plasmid before it is transformed into for example 
Aspergillus oryzae Jal250. Transf ormants are selected on mini- 
10 mal plates . 

A transformant where a double crossover event has taken 
place is identified by making an Aspergillus chromosomal DNA 
prep followed by a PCR screen for full-length mshll gene using 
appropriate primers. A Southern blot is made using chromosomal 
is DNA which is randomly fragmented with appropriate enzymes as 
well as appropriate probes for the deleted msHII fragment 
(which is not there any longer) as well as a positive control 
probe. 

In order to determine an increased mutation frequency in 
20 the strain with an inactivated msHII gene, a screen for muta- 
tions in the niaD gene is made. This is done by growing the 
parent strain Aspergillus Jal250 and the msHII inactivated 
strain on plates . 

A spore-suspension is made and aliquotes of spores are 
25 plated on chlorate-containing plate as described by Unkles et 
al. (S. E . Unkles, E.C.Campbell. Y.M.J.T. de Ruite Jacobs, M. 
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Broekhuisen, J. A. Macro, D.Carrez, R. Contreras, C. A. M.J. J. van 
den Hondel J.R. Kinghorn. The development of a homologous 
transformation system for Aspergillus oryzae based on nitrate 
assimilation pathway: A convenient and general selection system 
5 for filamentous fungal transformation. Molecular and general 
genetics V:218 p 99-104 (1989)). 

The strain with no expression of the MsHII protein will 
have a higher rate of niaD mutations (more chlorate resistant 
clones) , than the control strain. 

10 

EXAMPLE 7: 

Making In Vivo antisense msHII RNA to inhibit translation of 
msffll mRNA using the TAKA promoter to drive transcription of 
the Anti-sense mRNA 

is Anti-sense RNA expression is a well known way to down 

regulate expression of any gene in vivo (The design of An- 
tisense RNA , Georg Sczakiel , Antisense and nucleic acid drug 
development V. 7 P. 439-444 (1997)) 

A per fragment is made using the oligo's: 

20 000120j2 (SEQ ID NO 27): TCTGCGAATCGCTTGGATCCCGAACGCGACAACAC, 
000120] 4 (SEQ ID NO 28): GAGCTCAGATCTCTTAGGTTCTGGACGAGAAGA, 
and pUC19msh2P as template. This PCR fragment contains the 5 
v end of the msHII gene including the presumed part of 5' msHII 
mRNA. Another PCR fragment is made using the oligo's: 

25 000120j3 (SEQ ID NO 29): GTTGTCGCGTTCGGGATCCAAGCGATTCGCAGAAG, 
1298-TAKA (SEQ ID NO 30) : GCAAGCGCGCGCAATACATGGTGTTTTGATCAT, 
and pENI1298 as template (PCT DK99/00552) . 

Both PCR reactions are done using PWO polymerase according to 
the manufacturers manual (Boehringer-Mannheim) . 

30 The PCR fragments are purified using the Qiagen PCR puri- 

fication kit (Qiagen) . The two PCR fragments are mixed and a 
third PCR reaction is done with primer 1298-TAKA and 000120j4. 
In this manner the two PCR fragments are assembled. 

The assembled PCR fragment is cut with BssHII and Bglll, 

35 and purified from a 1.5 % agarose gel and ligated with pENI1298 
which was cut with BssHII and Bgl II (purified from 1 % agarose 
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gel) . The ligation mixture is transformed into £. coli. A DNA- 
prep is made of each of the resulting E. coli transf ormants . 
The assembled PCR fragment is sequenced to confirm that no un- 
wanted mutations are introduced during the procedure. The cor- 
5 rect construct contains the TAKA promoter, which drives the 
transcription of the msHII anti-sense mRNA. 

The resulting plasmid is transformed into for example As- 
pergillus oryzae Jal250 along with pENI1298 as control, and 
transf ormant are selected on minimal plates. The resulting 

10 transf ormants are isolated on minimal plates and incubated at 
37°C until they sporulate. 

In order to determine an increased mutation frequency in 
the strain, where the translation of the msHII mRNA is impeded 
due to msHII anti-sense RNA expression, a screen for mutations 

is in the niaD gene is made. A spore-suspension is prepared of the 
control transf ormants (pENil298) and of the jus HI I " Anti-sense 
RNA transf ormants, and equal amounts of spores are plated on to 
chlorate-containing plate as described by Unkles et al. (S. 
E.Unkles, E.C.Campbell. Y.M.J.T. de Ruite Jacobs, M. Broekhui- 

20 sen, J. A. Macro, D.Carrez, R. Contreras, C.A.M.J.J. van den 
Hondel J.R. Kinghorn. The development of a homologous transfor- 
mation system for Aspergillus oryzae based on nitrate assimila- 
tion pathway: A convenient and general selection system for 
filamentous fungal transformation. Molecular and general genet- 

25 ics V:218 p 99-104 (1989)). 

The strain with no or low expression of the MsHII protein 
will have a higher rate of niaD mutations (more chlorate resis- 
tant clones) , than the control strain. 
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CLAIMS 

1. A filamentous fungal cell, wherein a gene involved in the 
mismatch repair system has been inactivated and in which the 
gene involved in the mismatch repair system comprises: 

(a) a DNA sequence encoding the polypeptide sequence 
shown in positions 683-758 of SEQ ID NO 2; or 

(b) a DNA sequence encoding a polypeptide sequence which 
is at least 70% identical to the polypeptide sequence 
shown in positions 683-758 of SEQ ID NO 2 . 

2. A filamentous fungal cell, wherein a gene involved in the 
mismatch repair system has been inactivated and in which the 
gene involved in the mismatch repair system comprises: 

(a) a DNA sequence encoding the polypeptide sequence 
shown in positions 1-940 of SEQ ID NO 2; or 

(b) a DNA sequence encoding a polypeptide sequence which 
is at least 70% identical to the polypeptide sequence 
shown in positions 1-940 of SEQ ID NO 2. 

3. The filamentous fungal cell of claim 1 or 2, wherein the 
gene involved in the mismatch repair is defective. 

4. The filamentous fungal cell of claim 1 or 2, wherein the 
gene involved in the mismatch repair has been inactivated 
transitorily. 

5. The filamentous fungal cell of any of the preceding claims, 
wherein the filamentous fungal cell is a strain of Fusarium or 
more preferably Aspergillus. 

6. A process for preparing a filamentous fungal cell population 
wherein individual cells in the population comprise 
individually different DNA sequences of interest representing a 
DNA library of interest comprising following steps: 

(a) placing individually different DNA sequences of 
interest in a filamentous fungal cell population 
comprising a filamentous fungal cell of claim 1 or 2; 
and 
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(b) growing the population of (a) for a period of time 
allowing an individual DNA sequence of interest in the 
population to be duplicated at least once under 
conditions wherein the mismatch repair system gene of 
claim 1 or 2 has been inactivated. 

7. The process of claim 6, wherein the mismatch repair system 
under step (b) is defective. 

8. The process of claim 6, wherein the mismatch repair system 
under step (b) has been inactivated transitorily. 

9. The process of any of claims 6-8, wherein the filamentous 
fungal cell is a strain of Fusarium or more preferably 
Aspergillus . 

10. The process of any of claims 6-9, wherein, under step (b) 
of claim 6, there is an in vivo intergenic recombination of 
partially homologous DNA sequences of interest. 

11. A process for production of a polypeptide of interest 
comprising the steps of claim 6 and wherein the DNA sequences 
of interest encode a polypeptide of interest and which further 
comprises following step: 

(c) selecting from the resultant population of 
filamentous fungal cells of step (b) of claim 6 a 
desired polypeptide of interest. 

12. The process of claim 11, which further comprises following 
steps : 

(d) an optionally step comprising modifying the amino 
acid sequence of the desired selected polypeptide of 
interest according to a particularly further specific 
need; 

(e) placing the DNA sequence encoding the polypeptide of 
interest of step (c) of claim 11 or the modified 
polypeptide of interest of step (d) into a filamentous 
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fungal cell which is suitable for large scale 
production of the polypeptide of interest; 

(f) cultivating the filamentous fungal cell of step (e) 
in a fermentor of at least 10000 m3 under conditions 

5 permitting expression of the polypeptide of interest; 

and 

(g) isolating the polypeptide of interest. 

13. The process of claim 12, wherein the filamentous fungal 
10 cell which is suitable for large scale production of the 
polypeptide of interest of step (e) of claim 12 is another 
filamentous fungal cell as compared to the filamentous fungal 
cell of step (a) of claim 6. 

15 14. The process of any of claims 11-13, wherein the polypeptide 
of interest is a polypeptide derived from a filamentous fungal 
cell . 

15. The process of any of claims 11-14, wherein the polypeptide 
20 of interest is an enzyme, such as an amylase, a protease, a 
cellulase, a lipase, a xylanase; a phospholipase . 
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<11C> Novo Norilsk A./S 

<120> A filamer.- rus fungal mismatch repair defective cell 

<13C> Mismatch repair deficient fila. fungi 

<14C> 
<141> 

<160> 30 

<170> Patent In Ver. 2.1 

<210> 1 
<211> 3823 
<212> DNA 

<213> Aspegiliu5 oryzae 

<220> 
<221> CDS 

<222> (700) . . {72: ; 
<220> 

<221> intron 
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<220> 
<221> CDS 
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gaa ctt aag gtaagtaaac aagacaaccg gtctctcgaa cattcaataa 763 
Glu Leu Lys 

caattaaccc tgtcrac gtt gac gac gaa gtc ggc ttc att cgt ttt tac 813 

Val Asp Asp Glu Val Gly Phe lie Arg Phe Tyr 
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cgt tec etc gca gca aat age aac gat gaa act att cgc gtt ttc gac 861 

Arg Ser Leu Ala Ala Asn Ser Asn Asp Glu Thr lie Arg Val Phe Asp 
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cgc ggt gac tgg tac tct gec cat ggc gec aaa gca gag ttc ate get 909 

Arg Giy Asp Trp Tyr Ser Ala His Gly Ala Lys Ala Glu Phe He Ala 
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cgc act gtg tac sac acc ace tct ata etc cgc aat eta ggt cgc age -957 

Arg Thr Val Tyr Lys Thr Thr Ser He Leu Arg Asn Leu Gly Arg Ser 
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Trp Gly Ser Val Gly Thr Gly Lys Gly His Trp Lys Leu Val Lys Gin 
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teg gee aag gee gca gag get agg agt gtg gga gtg tgc ttt gcg gac 1245 

Ser Ala Lys Ala Ala Glu Ala Arg Ser Val Gly Val Cys Phe Ala Asp 

150 155 160 

gca agt gta egg gaa etc ggt gtt age gag ttt ctg gat aac gat ate 1293 

Ala Ser Val Arg Glu Leu Gly Val Ser Glu Phe Leu Asp Asn Asp He 

165 170 175 

tat tec aac ttt gag teg ctt att ate caa etc ggg gtg aag gag tgt 1341 

Tyr Ser Asn Phe Glu Ser Leu He He Gin Leu Gly Val Lys Glu Cys 

180 185 190 195 

ttg gtg cag atg gat get aat aag aag gat gtt gag ctg gga aag att 1389 

Leu Val Gin Met Asp Ala Asn Lys Lys Asp Val Glu Leu Gly Lys He 

200 205 210 

egg get att gec gat agt tgt ggg ate get ate tec gag agg ccg gtg 1437 

Arg Ala lie Ala Asp Ser Cys Gly He Ala He Ser Glu Arg Pro Val 

215 220 225 
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get gat tat ggt guc aag gat att gag cag gat ctg acg agg ttg ttg 
Ala Asp Tyr Gly Val Lys Asp He Glu Gin Asp Leu Thr Aro Leu Leu 
220 235 240 



1485 



agg gat gaa egg " zq get ggt acg ctg ccg cag acg gag eta aag czz 1533 

Arg Asp Glu Arg fer Ala Gly Thr Leu Pro Gin Thr Glu Leu Lys Leu 
245 250 255 

gcg atg eg: teg gcg tct gcg ttg ate aag tac ctt ggg gtz atg acg 1581 

Ala Met Gly Ser .-.la Ser Ala Leu He Lys Tyr Leu Gly Val Met Thr 

260 265 270 275 

gat cct aca aac zzc ggc cag tac cag etc tat cag cat qaz ttg teg 1629 

Asp Pro Thr Asn ?he Gly Gin Tyr Gin Leu Tyr Gin His Asc Leu Ser 

230 285 " 290 

cag ttt atg aag zzq gat teg teg gcg ctg cgt get ctt aac ctt atg 1677 

Gin Phe Mez Lys Leu Asp Ser Ser Ala Leu Arg Ala Leu Asn Leu Met 
295 300 305 

cct ggt ccg egg gsc gga teg aag tct atg agt ttg ttt ggt ttg ttg 1725 

Pro Gly Pre Arg Asp Gly Ser Lys Ser Met Ser Leu Phe Gly Leu Leu 
31C 315 320 

aat cac tgc aag aec cct gtt ggt age egg ttg ctt gcg cag tgg ctg 1773 

Asn His Cys Lys Thr Pro Val Gly Ser Arg Leu Leu Ala Gin Trp Leu 
325 330 335 

aaa cag ccg ttg azg gat ctg gcg gag ate gag aag aga cag cag ctt 1821 

Lys Gin Pre Leu Met Asp Leu Ala Glu He Glu Lys Arg Gin Gin Leu 

340 345 350 355 

gtt gag gcg ttt get gtt aac acg gag etc aga cag act atg cag gag 1869 

Val Glu Ala Phe Val Val Asn Thr Glu Leu Arg Gin Thr Met Gin Glu 

360 365 370 

gag cat ctt cgc tec ata ccg gat ctg tat aga eta gcg aag egg ttc 1917 

Glu His Leu Arg Ser He Pro Asp Leu Tyr Arg Leu Ala Lys Arg Phe 
3^5 380 385 



cag cgc aaa cag oca aac ztg gaa gac gtt gtg egg gtg tac cag gtt 
Gin Arg Lys Gin Ala Asn Leu Glu Asp Val Val Arg Val Tyr Gin Val 
35C 395 400 



gtt gac ctt gat gcg ctg gag aac cac gag ttc ate ate aag cct gag 
Val Asp Leu *Asp Ala Leu Glu Asn His Glu Phe He He Lys Pro Glu 
455 460 465 



1965 



get att eg- ttg ggt ttt gtc aac tct etc gag aat gtt ata gat 2013 

Ala He Arc Leu Pro Gly Phe Val Asn Ser Leu Glu Asn Val Met Asp 

405 410 415 

gaa gag taz cag acg ccc ctg gag acg gag tat act tec aac etc egg 2061 

Glu Glu Tyr Gin Thr Pro Leu Glu Thr Glu Tyr Thr Ser Asn Leu Arg 
420 425 430 435 

agt cac tcz gat age tta gcg aaa ctg gag gag atg gtt gag act acg 2109 

Ser His Ser Asp Ser Leu Ala Lys Leu Glu Glu Met Val Glu Thr Thr 

440 445 450 



2157 
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ttt gac gag agt etc egg ate ate agg aag aag ctg gac aag etc cgt 2205 

Phe Asp Glu Ser Leu Arg lie lie Arg Lys Lys Leu Asp Lys Leu Arg 

470 475 480 

cat gat atg ggc czz cag cac cgc agg gta get egg gac ctt gac caa 2253 

His Asp Met Gly Val Glu His Arg Arg Val Ala Arg Asp Leu Asp Gin 

485 490 495 

gat att gag aag eac ttg ttc ctg gag aac cac agg gtg cac gga tgg 2301 

Asp lie Glu Lys Lys Leu Phe Leu Glu Asn His Arg Val His Gly Trp 
500 505 510 ' 515 

tgc ttc cga czz act cgc aac gag teg gga tgc ate cgc aat aag aga 2349 

Cys Phe Arg Leu Thr Arg Asn Glu Ser Gly Cys He Arg Asn Lys Arg 

5 2C 525 530 

gag tac cag gaa zqz zct aca cag aag aac ggt gtc tac ttc act acg 2397 

Glu Tyr Gin Glu Cys Ser Thr Gin Lys Asn Gly Val Tyr Phe Thr Thr 

535 540 545 

teg act atg caa acc ttg cgc egg gag cat gat caa ctg tec teg aac 2445 

Ser Thr Met Gin Thr Leu Arg Arg Glu His Asp Gin Leu Ser Ser Asn 

550 555 560 

tac aat aga act cag acc ggc ctg gtg aat gag gtc gtt aac gtt gee 24 93 

Tyr Asn Arg Thr Gin Thr Gly Leu Val Asn Glu Val Val Asn Val Ala 

565 570 575 

gcg tec tac tgt cct gtt ttg gaa cga ctt gec ggt gtc ata gca cac 2541 

Ala Ser Tyr Cys Pro Val Leu Glu Arg Leu Ala Gly Val He Ala His 
580 585 590 595 

etc gat gtc att gta age ttc get cat get tct gtt cat gcg ccg acc 2589 

Leu Asp Val He Val Ser Phe Ala His Ala Ser Val His Ala Pro Thr 

600 605 610 

ccc tat get egg ccc aag atg cac cca cga ggc acc gga aac aca gtt 2637 

Pro Tyr Ala Arg Pro Lys Met His Pro Arg Gly Thr Gly Asn Thr Val 

615 620 625 

etc aag gaa gcg cgc cac ccc tgt atg gaa atg cag gat gat att tea 2685 

Leu Lys Glu Ala Arg His Pro Cys Met Glu Met din Asp Asp He Ser 

630 635 640 

ttc att act aat gat gtg get ttg gtc cga gac gag tec tec ttc etc 2733 

Phe He Thr Asn Asp Val Ala Leu Val Arg Asp Glu Ser Ser Phe Leu 

645 650 655 

ate att act get cct aac atg gga ggt aaa teg act tat att cgc caa 2781 

He He Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr He Arg Gin 
660 665 670 675 

att ggt gtt b:c get etc atg get cag acg ggc tgc ttt gtg cct tgt 2829 

He Gly Val He Aia Leu Met Ala Gin Thr Gly Cys Phe Val Pro Cys 

680 685 690 

aca gaa gca gaa ttg acc ate ttt gac tgt ate ctt gca cgt gtt ggt 2877 

Thr Glu Ala Glu Leu Thr He Phe Asp Cys He Leu Ala Arg Val Gly 
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c?£ 700 705 

gca agt zaz ::: rag etc aag gga gtt tec act ttc atg get gag atg 2925 
Ala Ser Asp S-r Gin Leu Lys Gly Val Ser Thr Phe Met Ala Glu Met 
"I-" 715 720 

etc gaa =:= zzz aac ate etc aag teg gca acg tec gag tct ctt ate 2973 
Leu Glu Thr Ser Asn He Leu Lys Ser Ala Thr Ser Glu Ser Leu He 
725 730 735 

ate ate gac gag ctt ggg cgc ggt aca age acg tat gac gga ttc ggc 3021 
He He As? Glu Leu Gly Arg Gly Thr Ser Thr Tyr Asp Gly Phe Gly 
740 745 750 755 

eta gca tgg gec ate tct gaa cac ate gtc aca gag att cgt tgc ttc 3069 
Leu Ala Trp Ala He Ser Glu His He Val Thr Glu He Arg Cys Phe 

"60 765 770 

ggc ctt t.c cc: act cac ttc cat gaa ttg aca get etc gee gat cga 3117 
Gly Leu Phe Ala Thr His Phe His Glu Leu Thr Ala Leu Ala Asp Arg 
775 780 785 

tac ccc aag zzz zzz aag aac ctg cac gta gtc gec ttc ate ggc gat 3165 
Tyr Pro Lys Ser Val Lys Asn Leu His Val Val Ala Phe He Gly Asp 
790 795 800 

ggt act gat gat gac agt gaa gat aag aag tec aag egg aac cag gtc 3213 
Gly Thr Asp Asp Asp Ser Glu Asp Lys Lys Ser Lys Arg Asn Gin Val 
805 810 815 

act ctt ctg tac egg gtc gaa cct ggc att tgt gac cag tea ttc ggt 3261 
Thr Leu Leu Tyr Arg Val Glu Pro Gly He Cys Asp Gin Ser Phe Gly 
820 825 830 835 

ate cac gtn gec gaa ttg gtc cgc ttc ccg gag aag gtg gtc aac atg 3309 
He His Val Ala Glu Leu Val Arg Phe Pro Glu Lys Val Val Asn Met 

340 845 850 

gec cgc cag aag gca gag gaa ctt gaa gac ttt act tea tec gaa cag 3357 
Ala Arg Gin Lys Ala Glu Glu Leu Glu Asp Phe Thr Ser Ser Glu Gin 
355 860 865 

caa gac cag cag z za tec atg gcg ate gat aaa tac teg cag gaa gaa 3405 
Gin Asp Gin Gin Ser Ser Met Ala He Asp Lys Tyr Ser Gin Glu Glu 
870 875 880 

gtt gag gag ggc agt gec ctt etc aaa gcg atg ctg ctg aaa tgg aag 3453 
Val Glu Glu Gly Ser Ala Leu Leu Lys Ala Met Leu Leu Lys Trp Lys 
885 890 895 

teg gag acc gag tec tct ggt aag gag ttg aca gtg gaa gag aag cga 3501 
Ser Glu Thr Glu Ser Ser Gly Lys Glu Leu Thr Val Glu Glu Lys Arg 
900 905 910 915 

cag ate atg cgt gat etc gtc aaa gca gat gag aag ctg caa gca aac 3549 
Gin lie Mez Arg Asp Leu Val Lys Ala Asp Glu Lys Leu Gin Ala Asn 

520 925 930 

aag gtc ttc cag ggt ate aag get tta tag attagtattt gcgtcttttt 3599 
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Lys Val Phe Gin Gly lie Lys Ala Leu 
935 940 

tctttctcgg ggtcazagcg gttcggcgtt tggaaggtgt caatctgtgt atgtgtgatc 3659 

tacggacatg aggataaaat gtgtagggaa taatattatc caaaaatttt cgagtgattg 3719 

cttctttgga catatecctt aggattaaag tggaaaggga gaaatcccat tcaactatat 3779 

cgacataagt caccctgaga tcgcgagtct agacgctcac cggg 3823 

<2i0> 2 
<211> 940 
<212> PRT 

<213> Aspegilius cryzae 
<400> 2 

Met Ser Ser Arg Pre Glu Leu Lys Val Asp Asp Glu Val Gly Phe He 

15 10 15 

Arg Phe Tyr Arg Ser Leu Ala Ala Asn Ser Asn Asp Glu Thr He Arg 

20 25 30 

Val Phe Asp Arg Gly Asp Trp Tyr Ser Ala His Gly Ala Lys Ala Glu 

35 40 45 

Phe He Ala Arg Thr Val Tyr Lys Thr Thr Ser He Leu Arg Asn Leu 

50 55 60 

Gly Arg Ser Asp Ser Gly Gly Leu Pro Ser Val Thr Met Ser Val Thr 
65 70 75 80 

Val Phe Arg Asn Phe Leu Arg Glu Ala Leu Phe Arg Leu Asn Lys Arg 

85 90 95 

He Glu He Trp Gly Ser Val Gly Thr Gly Lys Gly His Trp Lys Leu 

100 105 110 

Val Lys Gin Ala Ser Pro Gly Asn Leu Gin Asp Val Glu Glu Glu Leu 

115 120 125 

Gly Ser Val Gly Gly Leu Ser Met Asp Ser Ala Pro He lie Leu Ala 

130 135 140 

Val Lys He Ser Ala Lys Ala Ala Glu Ala Arg Ser Val Gly Val Cys 
145 150 155 160 

Phe Ala Asp Ala Ser Val Arg Glu Leu Gly Val Ser Glu Phe Leu Asp 

165 170 175 

Asn Asp lie Tyr Ser Asn Phe Glu Ser Leu lie lie Gin Leu Gly Val 

180 185 190 

Lys Glu Cys Leu Val Gin Met Asp Ala Asn Lys Lys Asp Val Glu Leu 

195 200 205 

Gly Lys lie Arg Ala lie Ala Asp Ser Cys Gly He Ala He Ser Glu 

210 215 220 

Arg Pro Val Ala Asp Tyr Gly Val Lys Asp lie Glu Gin Asp Leu Thr 
225 23C 235 240 

Arg Leu Leu Arg Asp Glu Arg Ser Ala Gly Thr Leu Pro Gin Thr Glu 

2^ 250 255 

Leu Lys Leu Ala Met Gly Ser Ala Ser Ala Leu lie Lys Tyr Leu Gly 

260 265 270 

Val Met Thr Asp Pro Thr Asn Phe Gly Gin Tyr Gin Leu Tyr Gin His 

275 280 285 

Asp Leu Ser Gin Phe Met Lys Leu Asp Ser Ser Ala Leu Arg Ala Leu 

290 295 300 

Asn Leu Met Pro Gly Pro Arg Asp Gly Ser Lys Ser Met Ser Leu Phe 
305 310 315 320 

Gly Leu Leu Asn His Cys Lys Thr Pro Val Gly Ser Arg Leu Leu Ala 
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325 










330 










335 




Gin 


Trp 


Leu 


L v s 


Gin 


Pro 


Leu 


Met 


Asp 


Leu 


Ala 


Glu 


lie 


Glu 


Lys 


Arg 


















345 










350 




Gin 


Gin 


T ft:- 


V Cl _ 


Glu 


Ala 


Phe 


Val 


Val 


Asn 


Thr 


Glu 


Leu 


Arg 


Gin 


Thr 






* — — 










360 










365 








Met 


Gin 


Glu 


G-.U 


His 


Leu 


Arg 


Ser 


lie 


Pro 


Asp 


Leu 


Tyr Arg 


Leu 


Ala 




370 










375 










380 










Lys 


Arg 


Phe 


Gin 


Arg 


Lys 


Gin 


Ala 


Asn 


Leu 


Glu 


Asp 


Val 


Val 


Arg 


Val 


385 










390 










395 








400 


Tyr Gin 


Val 


Ala 


He 


Arg 


Leu 


Pro 


Gly 


Phe 


Val 


Asn 


Ser 


Leu 


Glu 


Asn 










405 










410 










415 




Val 


Met 


Asp 


Glu 


Glu 


Tyr 


Gin 


Thr 


Pro 


Leu 


Glu 


Thr Glu Tyr 


Thr 


Ser 








420 










425 










4 30 






Asn 


Leu 


Arg 


Ser 


His 


Ser 


Asp Ser 


Leu 


Ala 


Lys 


Leu 


Glu 


Glu 


Met 


Val 






435 










440 










445 








Glu 


Thr 


Thr 


Vai 


Asp 


Leu Asp Ala 


Leu 


Glu 


Asn 


His 


Glu 


Phe 


lie 


lie 




450 










455 










4 60 










Lys 


Pro 


Glu 


Phe 


Asp 


Glu 


Ser 


Leu 


Arg 


He 


lie 


Arg 


Lys 


Lys 


Leu 


Asp 


465 










470 










475 










480 


Lys 


Leu 


Arg 


His 


Asp 


Met 


Gly 


Val 


Glu 


His 


Arg 


Arg 


Val 


Ala 


Arg Asp 










485 










490 










495 




Leu 


Asp 


Gin 


Asp 


He 


Glu 


Lys 


Lys 


Leu 


Phe 


Leu 


Glu 


Asn 


His 


Arg 


Val 








500 










505 










510 




His 


Gly 


Trp 


Cys 


Phe 


Arg 


Leu 


Thr 


Arg 


Asn 


Glu 


Ser 


Gly 


Cys 


lie 


Arg 






515 










520 










525 




Asn 


Lys 


Arg 


Glu 


Tyr 


Gin 


Glu 


Cys 


Ser 


Thr 


Gin 


Lys 


Asn 


Gly Val 


Tyr 




530 










535 










540 








Phe 


Thr 


Thr 


Ser 


Thr 


Met 


Gin 


Thr 


Leu 


Arg 


Arg 


Glu 


His 


Asp 


Gin 


Leu 


545 










550 










555 








560 


Ser 


Ser 


Asn 


Tyr 


Asn 


Arg 


Thr 


Gin 


Thr 


Gly 


Leu 


Val 


Asn 


Glu 


Val 


Val 










565 










570 










575 




Asn 


Val 


Ala 


Ala 


Ser 


Tyr 


Cys 


Pro 


Val 


Leu 


Glu Arg 


Leu 


Ala 


Gly Val 








580 










585 










590 






He 


Ala 


His 


Leu 


Asp 


Val 


He 


Val 


Ser 


Phe 


Ala 


His 


Ala 


Ser 


Val 


His 






595 










600 










605 








Ala 


Pro 


Thr 


Pro 


Tyr 


Ala 


Arg 


Pro 


Lys 


Met 


His 


Pro 


Arg 


Gly 


Thr 


Gly 




610 










615 










620 






Asn 


Thr 


Val 


Leu 


Lys 


Glu 


Ala 


Arg 


His 


Pro 


Cys 


Met 


Glu 


Met 


Gin 


Asp 


625 










630 










635 










640 


Asp 


He 


Ser 


Fhe 


He 


Thr 


Asn 


Asp 


Val 


Ala 


Leu 


Val 


Arg 


Asp 


Glu 


Ser 










645 










650 










655 




Ser 


Phe 


Leu 


He 


lie 


Thr 


Gly 


Pro 


Asn 


Met 


Gly Gly Lys 


Ser 


Thr 


Tyr 


He 






660 










665 










670 




Arg 


Gin 


lie 


Gly 


Val 


He 


Ala 


Leu 


Met 


Ala 


Gin 


Thr Gly Cys 


Phe 






675 










680 










685 








Val 


Pro 


Cys 


Thr 


Glu 


Ala 


Glu 


Leu 


Thr 


He 


Phe 


Asp Cys 


He 


Leu 


Ala 




690 










695 










700 










Arg 


Val 


Gly 


Ala 


Ser 


Asp 


Ser 


Gin 


Leu 


Lys 


Gly Val 


Ser 


Thr 


Phe 


Met 


705 


Glu 








710 










715 










720 


Ala 


Met 


Leu 


Glu 


Thr 


Ser 


Asn 


He 


Leu 


Lys 


Ser 


Ala 


Thr 


Ser 


Glu 


Ser 








725 










730 










735 




Leu 


He 


lie 


He 


Asp 


Glu 


Leu 


Gly Arg 


Gly Thr Ser Thr 


Tyr 


Asp 


Gly 






740 










745 










750 


Phe 


Gly 


Leu 


Ala 


Trp Ala 


He 


Ser 


Glu 


His 


He 


Val 


Thr 


Glu 


lie 






755 










7 60 










765 






Arg Cys 


Phe 


Gly 


Leu 


Phe 


Ala 


Thr 


His 


Phe 


His 


Glu 


Leu 


Thr 


Ala 


Leu 




770 










775 










780 










Ala 


Asp Arg 


Tvr 


Pro 


Lys 


Ser 


Val 


Lys 


Asn 


Leu 


His 


Val 


Val 


Ala 


Phe 


785 










790 










795 










800 
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He 


Gly 


Asp 


Gly 


Th 


v- 


Asp 


Asp 


Asp 


Ser 


Glu 


Asp 


Lys 


Lys 


Ser 


Lys 


Arg 










30 












810 










815 




Asn 


Gin 


Val 


Thr 


le 




Leu 


Tyr 


Arg 


Val 


Glu 


Pro 


Gly 


He 


Cys 


Asp 


Gin 








820 












825 










830 






Ser 


Phe 


Gly 


He 


Hi 


s 


Val 


Ala 


Glu 


Leu 


Val 


Arg 


Phe 


Pro 


Glu 


Lys 


Val 






835 












840 










845 








Vai 


Asn 


Met 


Ala 


Ar 


z 


Gin 


Lys 


Ala 


Glu 


Glu 


Leu 


Glu 


Asp 


Phe 


Thr 


Ser 




850 












855 










860 










Ser 


Glu 


Gin 


Glr. 


As 


p 


Gin 


Gin 


Ser 


Ser 


Met 


Ala 


He 


Asp 


Lys 


Tyr 


Ser 


865 












870 










875 










880 


Gin 


Glu 


Glu 


» * — * 

v a x 


o _ 


- 


Glu 


Gly 


Ser 


Ala 


Leu 


Leu 


Lys 


Ala 


Met 


Leu 


Leu 






















890 










895 




Lys 


Trp 


Lys 


Ser 


Gl 




Thr 


Glu 


Ser 


Ser 


Gly 


Lys 


Glu 


Leu 


Thr 


Val 


Glu 








C «. r\ 












905 










910 






Glu 


Lys 


Arg 


Gin 


I i 




Met 


Arg 


Asp 


Leu 


Val 


Lys 


Ala 


Asp 


Glu 


Lys 


Leu 






915 












920 










925 








Gin 


Ala 


Asn 


Lys 


Va 




Phe 


Gin 


Gly 


He 


Lys 


Ala 


Leu 











930 935 940 



<210> 3 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # U 
<400> 3 

gggaagctgc caggccccag tgtcagcctc ctatgctc 38 



<210> 4 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # L-G.T 
<400> 4 

gagcatagga ggctgacatt ggggcctggc agcttccc 38 



<210> 5 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description cf Artificial Sequence: oligo # 
L-G.A. 

<400> 5 

gagcatagga ggctgacaat ggggcctggc agcttccc 38 



<210> 6 



WO 00/50567 



9 



PCT/DKOO/00063 ~ 



<211> 38 
<212> DNA 

<2 1 3 > Artificial Sequence 
<22Z> 

<22 5> Descrip- icr. cf Artificial Sequence: oligo # 
L-G.G.* 

<40C> 6 

gagcaragga qqczzacaqt ggggcctggc agcttccc 38 

<21C> 7 
<21i> 38 
<212> DNA 

<213> Artificial Sequence 
<22C> 

<222> Descriptor, cf Artificial Sequence: oligo 8 
I.-A.C. 

<40C> 7 

gagcatagga ggczoacacc ggggcctggc agcttccc 38 

<210> 8 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # L-TG 
<400> 8 

gagcatagga ggctgacact gtggggcctg gcagcttccc 40 

<21C> 9 
<21I> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # L-HOM 
<400> 9 

gagcatagga ggcTigacacc ggggcctggc agcttccc 38 

<210> 10 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description cf Artificial Sequence: oligo # PR 
117858 



<40C> 10 
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ggcncarath ggntgyttyg tncc 

<210> II 

<21i> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description, of Artificial 
117859 

<400> 11 

gcccangcna rnccraancc 

<210> 12 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 12 

gctcgaaaca tccaacatcc 

<210> 13 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 13 

gctgtgaatc acttgcacc 

<210> 14 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 14 

cttcataaac tgcgacaaat catgc 

<210> 15 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 



Sequence: oligo # Pr 

20 

Sequence: oligo # 130740 

20 

Sequence: oligo # 130741 

19 

Sequence: oligo # 131928 

25 

Sequence: oligo # 131929 
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<400> 15 

ggaggagca- zzzcqc 16 

<210> 16 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # 131930 

<400> 16 

ggaactcgaa ;ac:;tactt catcc 25 

<210> 17 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # 134608 
<400> 17 

ccagaaactc gctaacacc 19 

<210> 18 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # 
134605 

<400> 18 

gtgctttgcg gacgc !5 

<210> 19 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # 134610 
<400> 19 

caggacagta ggacgc 16 

<210> 20 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: oligo # 135320 
<400> 2C 

cgagcgatga act zzqc 17 



<210> 21 
<211> 17 
<212> DNA 

<213> Art i f icial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # 135321 
<400> 21 

gcgttggtgg atta::c 17 



<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # 136105 
<400> 22 

cgttgcatct atcaratacc 20 



<210> 23 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # 
136106 

<400> 23 

ggtatatgat agatccaacg c 

<210> 24 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo ff 
137208 

<400> 24 

ccgcgtctcc aacaagatga atgg 



<210> 25 
<211> 21 
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<2I2> DNA 

<215> Artificial Sequence 
<220> 

<222> Descriptirr. of Artificial Sequence: oligo # 137207 
<403> 25 

ccgctttctc gggg~catag c 21 

<21C> 26 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<22C> 

<223> Descript ic:\ of Artificial Sequence: primer 138149 

<400> 26 

cctttccacr ttaatcctaa gc 22 

<210> 27 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # 
000120j2 

<400> 27 

tctgcgaatc gcttggar.cc cgaacgcgac aacac 35 

<21C> 28 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Descriptior: of Artificial Sequence: oligo # 
000120j4 

<400> 28 

gagctcagat ctcttacgtt ctggacgaga aga 33 

<210> 29 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description cf Artificial Sequence: oligo ii 
000120j3 

<400> 29 

gttgtcgcgt -cggga.cca agcgattcgc agaag 35 



\VO 00/50567 



14 



PCT/DK00/00063 ~ 



<2I0> 30 

<2il> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo # 
1298-TAKA 



<400> 30 

gcaagcgcgc gc==tacatg gtgttttgat cat 



33 
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